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ABSTRACT 

The Bayesian decision method has several features which 
are desireable in the sampling inspection process for quality 
control. These features include: (1) comparison of the value 
of sample information in the decision process with the cost 
of obtaining the information; (2) basi n g deci s i ons on their 
consequences to the decision maker; and (3) allowing the use 
of subjective information in the decision process. In this 
paper the Bayesian decision procedure as it applies to 
variables sampling for quality control is examined. The 
basic method is developed for both simultaneous and sequential 
sampling and the modeling of decision consequences is dis- 
cussed. Various models for the production process are 
provided and solutions for the generalized linear model 
obtained. Finally the incorporation of subjective information 
is discussed. 
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I. INTRODUCTION 



One of the primary concerns in the quality control of 
items produced by or received from a production line are the 
procedures by which decisions are made concerning the quality 
of the material produced. In order to develop a decision 
procedure the abstract term "quality" must be operati onal ly 
defined. This definition usually takes the form of an equip- 
ment specification which lists the characteri sti cs required 
of the unit to perform its intended function. In the case of 
100% inspection every unit produced or received is subjected 
to a test in which these characteri s ti cs are measured. Based 
on the test results the decision procedure is to accept those 
units which satisfy the specification and reject those which 
do not. When sampling inspection is used, a sample of the 
product ion output is tested. The test results of the sample 
are then used to make and accept or reject decisions con- 
cerning the population or lot from which the sample was drawn. 

The decision procedure in this case requires that a 
decision function be specified which indicates, for the test 
results observed, the decision to be made (accept or reject). 
Unlike 100% inspection, in sample testing there always exists 
the possibility that the decision function may indicate an 
erroneous decision. 
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Tile consequences of erroneous decisions represent a 
loss to the decision maker and can range from mild to severe. 
As an example suppose a machine is judged to be out of 
calibration when in fact it is not. Then the loss to the 
decision maker would be the cost of a needless recalibration 
which may be small. On the other hand if production quality 
were judged to be acceptable when in fact it was not, 
consumers might seek alternate sources of supply. This could 
result in the loss of entire production contracts and reputa- 
tion. Although the above examples are oversimplified the 
main idea is that erroneous decisions always represent a loss 
to the decision maker. Thus in the design of a decision 
procedure the loss due to erroneous decisions must be 
cons i dered . 

Another consideration in the design of a decision 
procedure is the cost of testing the sample units. This 
cost includes the labor and test facilities required and may 
include the cost of the units themselves (if the tests are 
destructive) or repair costs if the units fail. These costs 
may be large if complex facilities are required or test time 
is long. If the cost of testing is larger than the antici- 
pated consequences of a decision then the cost of information 
is greater than its value in the decision process. Under 
these circumstances, gathering further information (testing) 
is counterproducti ve . Thus a decision procedure should 
indicate the "value 1 of additional information to the decision 
maker . 
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The most common method of specifying a decision 
procedure is based on classical hypothesis testing. In this 
approach two points on the operating characteristic (OC) 
curve for the decision procedure are specified. The OC curve 
for the procedure is the probability of acceptance versus 
equipment quality. The required sample size and reject/accept 
criteria are then developed based on the sampling distribution 
using a likelihood ratio test. Another method which provides 
greater flexibility and has features absent in the classical 
method is the Bayesian decision approach. In the Bayesian 
method the decision procedure is optimized for a loss function 
specified by the user which reflects this particular applica- 
tion. If the loss function and the cost of testing are 
expressed in the same units the cost of information can be 
obtained. The Bayesian method also contains the classical 
procedure as a special case. Another feature of the Bayesian 
method is that specific knowledge of the behavior of the 
production process as well as subjective information can be 
incorporated thus allowing the decision procedure to adapt to 
changing requi rements . 

In the following sections the basic Bayesian decision 
procedure will be outlined and the specification of loss 
functions and models for the production process discussed. 

The generalized linear model is introduced and the recursion 
equations developed to facilitate calculation of posterior 
distributions. Finally, the i ncorporati on of subjective 
information is discussed. 
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II. THE BAYESIAN METHOD 



A. THE GENERALIZED BAYESIAN DECISION PROCEDURE 

In order to discuss the Bayesian method as applied to a 
production line, a generalized model of the production and 
sample test process is required. Let e represent the charac- 
teristic of the equipments upon which decisions are to be 
made. For example e would be the average or mean gain of a 
production lot of amplifiers. The actual value of 9 is not 
observable, however, we can perform tests which indicate the 
gain of an individual amplifier. Let x indicate the results 
of such test. Also let e and x be related through a known 
probability density function denoted by f(x)|e). As a model 
of the generalized production process we assume a random 
process such that for each time t, has continuous distri- 
bution. It is also assumed that there exists a time increment 
A t> 0 for which 9 is constant i.e., 9 t = 0 t+A _ for all t. 

This assumption implies that given a production increment of n 
items produced during At, test results for each unit are 
samples from f ( x 1 9 1 ) . Figure 1 shows how 9 might vary for the 
generalized process. As shown in the figure, 9 for the 
increment At is a fixed but unknown quantity. It is the units 
produced during each At which are the object of the decision 
process. At each end point t^ , t^, ... a decision must be 
made to either accept or reject the units produced in the 
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Figure 1. Generalized Production Process. 



preceding time increment. The decisions will be based on the 
sample data (x) which provides information concerning the 
true value of 9 . At time tQ let the probability density 
f( 9 Q ) represent the uncertainty as to the true value of 9 . 
Since the value of 9 at time t may or may not be independent 
of the previous values of e ( 9 1 ,, 0^ ^ •••) let the uncer- 
tainty of © t given ( 9 1 _ -j , 9 t _£ •••) be expressed by means of 

the conditional density f( 9 t | 0 t _-|, 0 t _ 2 •••) which is known 
or can be estimated but not necessarily the same for all t. 
Since the decisions (accept or reject) are to be based on the 
characteri s ti c 9, it is necessary to specify a loss function 
which indicates the consequences of a particular decision 
when a specific value of 9 obtains, for all possible values 
of 9 and all decisions. Let 0 represent the set of all 

possible values of 9 and let D represent the set of all 

decisions d. Then let L(e,d) be the loss function which is 
known for all deD and 9 e 0 . The loss for a particular decision 
deD depends on the actual value of 0, but 9 is unknown. The 
expected loss of decision d would be the product of L(e,d) 
times the probability that 9 obtains, summed over all possible 
values of 9. Let p(d) denote the expected loss or risk of 
decision d then: 

p(d) = J* L ( 0 , d ) f ( 9 ) d 9 . ( 1 ) 

0 
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The optimal decision in terms of minimizing the risk is the 
decision deD which minimizes equation (1) and is denoted by 
d* therefore: 



p (d* ) = min f L(e,d) f(e)de . 
deD e 



( 2 ) 



Thus given a loss function L ( e , d ) and the distribution of 
e, f ( e ) , the optimal decision is defined by equation (2). 

The optimal decision (d*) is usually referred to as the Bayes 
decision against f(0). Since the decisions (accept or reject) 
wTll be based on the sample data (x.:[x-|,X 2 » ..., x n ]) it is 
desirable to specify a decision function, denoted by s(x_), 
which for every value of x_ observed, specifies the decision 
d*, i.e., the decision which minimizes the risk. Let S n be 
the set of all possible test results. Then the risk of the 
decision function 6 (x ) is by equation (1) 



We want to find the decision function S*(x_) which minimized 
the risk as expressed in equation 3. Assuming L ( e , 5 ( x_) ) 1 s 
bounded, interchanging the order of integration yields: 



Equation (4) is minimized when the expression in brackets is 

minimized for each value of x_eS n . Thus the optimal decision 

function <5*(x_) would specify a decision d* which minimized 
the integral 
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(4) 
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J L ( e , d ) f (x | 0 ) f ( e ) de . 



(5) 



0 



From Bayes theorem f(e|x_) = ^ — 



( 6 ) 



where f(x_) = |e)f(e)de is a constant given a sample 

0 




value for x_. Then minimizing the integral of (5) is equiva- 
lent to selecting a decision d* which minimizes: 



for each value of x_ observed. Thus it is not necessary to 
determine in advance a decision function <$(x_) which specifies 
d* for all possible values of x_. As each x_ is observed the 
posterier, f(e|x_) 5 is calculated from the prior, f(e), by 
equation (6) and d* is chosen to satisfy 



This is the same result as equation (2) except that the 
posterior based on the test data x is used instead of the 
prior. Thus equation (2) defines the optimal decision d* 
before and after sampling as long as the appropriate value for 
f ( 0 ) is used. 

The next step is to determine whether the decision should 
be made without sampling based on the prior distribution, 
f ( 0 ) , or the sample tested and the decision based on the 
posterior distribution f(e|x_). In order to determine which 
action is optimal the risk of 'obtaining an additional sample 




0 ' 0 



(7) 




(8) 
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and then proceeding in an optimal fashion must be obtained. 

If the risk of selecting a decision immediately is greater 
than the risk of obtaining a sample result and then proceeding 
in an optimal manner, then the sample should be tested since 
this is the minimum risk action. Let p(<f>,_x) denote the risk 
of obtaining a sample x when the prior of 9 i s <p and then 
proceeding in an optimal manner. Also let p($,d*) denote the 
risk of making decision d* when the prior of 0 is <j> . Then 
the following decision rule will be used. 

If p($,d*) > p(i>,x) test the sample; (9) 

otherwise, make decision d*. p(<f>,d*) is obtained from 

equation (2) where <t> = f(e) is: 



To determine p(<j>,x) two cases must be distinguished; the 
samples are tested si mutaneous ly or sequential sampling is 
used . 

1 . Simultaneous Sampling 



sample results is not zero. Therefore let C n (_x) represent the 
cost of testing the units 1, 2, ..., n where test results are 
represented by the vector x = (x-j , > •••> x n ). In man Y 

cases the cost of testing is independent of the values 
obtained, in which case the cost would be just C n> the cost of 
testing n units. The expected loss of testing n units and 
then making the optimal decision d* plus the cost of testing 




( 10 ) 



It has been assumed that the cost of obtaining 
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is p(<j>,x). The distribution of e if x is observed will be 
f(e|x) thus from equation (10): 



pU.x) = J J^min J L ( e , d ) f ( e | x ) dej f(x)dx + E[C n (x)] (11) 
S 0 



Thus the decision procedure for the production lot would be 
as follows: 

1. Prior to testing determine p(<f>,d*) from equation 
(10) based on the prior f(e). 

2. Determine the risk of testing, p($,x_), from 
equation (11). 

3. If p(<j>,d*) < p ( <{> , x_) make decision d* otherwise. 

4. Test sample units to obtain data x_ and make 
decision d* according to equation (8). 

2 . Sequential Sampling 

The risk of sampling in a sequential procedure 
differs from the simultaneous case because after a sample is 
tested two actions are possible, (a) make a decision or (b) 
continue sampling. Because of this, determining the risk of 
a sequential procedure is, in general, more difficult than 
determining the risk of the simultaneous case just discussed. 
In most cases of practical interest the sample size has a 
fixed upper bound. Let n denote the maximum number of samples 
available for testing. Then the risk of testing the first 
unit and proceeding in an optimal fashion is the risk of the 
n step sampling procedure where after each sample is tested 
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the risk of continuing is compared with the risk of choosing 
a decision and the minimum of the two risks is chosen. After 
the first sample is taken, the risk of choosing a decision 
must be compared to the risk of continuing with the n-1 step 
sampling procedure. Thus as each sample is drawn the risk of 
continuing changes due to the change in the s amp 1 e number 
remaining as well as the new prior based on the samples 
observed. The above process may be viewed as a decision tree 
shown in figure 2 which depicts the sequential decision 
process for n=4. At each step (k) k = 0, 1, 2, ..., n the 
risk of making an immediate decision is denoted by p(<j> k ,d*) 
where <j>^ is the distribution of 0 based on k samples and is 
defined in equation (10). At each step this must be compared 
with the risk of continuing the sequential test process and 
the minimum risk action chosen. The risk of continuing at 
each step is denoted by pU k ,Xfc + i) wh ere x k+ i = C x k+ -| » x k+2 , 
..., x ] indicating the dependence on the current prior 

and the remaining samples. 

The difficulty alluded to earlier is in obtaining 
values for p ( >x k + i ) • The general solution procedure uses 
a backward induction starting at the last step and working 
backward to the first step to obtain the continuation risk at 
each step. For the n=4 case depicted in Figure 2 the proce- 
dure would be as follows. At step 3 after three samples had 
been observed the optimal action would be the minimum of 
P U 3 ,d*) and p ( <f> 3 , >L 4 ) * Where p(<j> 3 ,d*) is the risk of the 
optimal decision given <j> 3 as defined in equation (10) and 
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Figure 2. Sequential Decision Process. 



p(‘f , 3 ’— 4 ^ is the risk of obtaining an additional sample 
and then making the optimal decision. Since a decision must 
be made after x 4 is observed this is the same as the sampling 
risk for simultaneous sampling defined in equation ( 11 ) with 
x_ equal to x^. Thus the risk at step 3 is a function of <j > 3 
denoted by p ( ^ ) and would be: 

p(<j> 3 ) = min Cp ( <f > 3 > d* ) , pU 3 »x 4 )]. ( 12 ) 

The risk at step 2 will be the minimum of the decision risk 
pC^’d*) and the continuation risk p(<j> 2 ,x 3 ). The continuation 
risk p(<j) 2 ,x. 3 ) 1S the expected value of the risk at step 3 
based on the sample, x^. Thus, 



P ( $ 2 * 2 L 3 ) — ECp( < J ) 3 (x 3 ))] (13) 

j min[pU 3 (x 3 Xd*), p (i> 3 (x 3 ) ,x 4 )]f (x 3 )dx 3 + C 3 



where: ^ 3 ( x ^ ) is $2 given x 3 i.e. f 2 (s |x 3 ) 



f(x 3 ) = J f(x 3 |e)f 2 (e)d0 

0 

C 3 = expected cost of obtaining value x 3 

Thus at step 2, the optimal action again being that with 
minimal risk, the risk is 

p ( <}> 2 ) = roin[p($2»d*)» p ( <J> 2 » 2L3 ) ] • O 4 ) 
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This approach is then repeated for step 1 which yields 



p ( *j> i > x_2 ^ ~ ECp(i , 2^2^^ = 



( 15 ) 




pU 2 ( x 2)] f < x 2) dx 2 + C 2 



and 



p ( $ 1 ) ~ nii n[p ( <j>i , d* ) , p ( 4* i > 2^2 ^ ^ 



(16) 



Thus at step 0, the beginning of the procedure, the risk of 
the entire sequential test procedure would be 



where p ( ^ q » 2ii ) = £ Cp(^-| ( x i ))3 is the risk of the entire 
sequential sampling plan. 



risk of a sequential procedure is a non-trivial exercise. 

The degree of difficulty depending on the sample size n, the 
loss function L(e,d) and the sampling and parameter distribu- 
tions, f ( x | e ) and f ( e ) . Examples of the above procedure for 
sequential testing may be found in the open literature 
[1 , 2 and 3] . 

3 . Comparison of Sequential Versus Simultaneous Sampling 
In the design of a quality control procedure the 
method of sampling must be specified. In order to determine 
which of the two methods is preferred in a given situation 
the risks of the two procedures should be compared. In lieu 
of mitigating circumstances such as ease of implementation. 



pU Q ) = min[ P U 0 ,d*) , pUq.x^)] 



(17) 



As seen from the above discussion de.termining the 
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increased complexity, etc., the decision as to which sampling 
scheme, sequential or simultaneous, is optimal should be 
based on the risks of the two procedures. That is if p($*) 
is the risk of the optimal sampling scheme s* then 

p(s*)=min[p(Sq),p(s i )]. ( 18 ) 

where Sq = sequential sampling 

s. = simultaneous sampling 

In many cases the decision as to which sampling 
scheme is superior is obvious due to the nature of the test. 
For example, if the cost of testing is constant regardless of 
the number of units tested then simultaneous testing would 
provide minimum risk. If however the cost of testing were 
only a function of the number of units tested then sequential 
testing would be superior. It is when the cost of testing 
assumes some combination of the two extremes that the optimal 
choice becomes unclear, in which case the risks of each 
procedure must be compared to determine the optimal approach. 

B. APPLICATION TO THE GENERALIZED PRODUCTION PROCESS 

In order to gain insight to the use and requirements of 
the Bayesian decision method, the implementation of the 
procedure on the generalized process of Figure 1 will now be 
discussed for the simultaneous sampling case. 

At time t Q , t-j , 1 2 , ... the following assumptions are 

made . 
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1. A production sample of size from production 
lot Q t is available for testing. 

2. Samples are independent given and the sampling 
density f ( x | 9 1 ) is known. 

3. The cost of testing the n units, C n , is known. 

4. A loss function L(e,d) is specified for all deD 
and e e0 . 

5. At time t Q , prior to sampling, the distribution 
of 9 is known and denoted by f(0Q). 

6. The conditional distribution of 9 t+ -| given 

9 t’ 9 t-i» ••• known and denoted by f ( e 1 I 0 1 ) 
where a markov dependence is assumed for 
i 1 1 us trati on . 

At time t Q , prior to sampling, two actions are feasible. 
Either make a decision (accept or reject) or test the sample 
to gain information. If a decision is made without sampling 
the risk will be p(d*) as defined by equation (8). The risk 
of testing the sample x^ = (x-pXg, .... x p ) and making the 
optimal decision, p(<j>,x) is given by equation (11). Assume 
that sampling represents the minimal risk action. After the 
sample result is obtained, the prior f 7 ( 0 q ) must be revised 
and the optimal decision d* chosen. Denote the posterior or 
new prior based on the data sample by f ( 0 q 1 2l) , Then by Bayes 
theorem 



f ( 9 0 I — ) 



f (x| e 0 )f(9 0 ) 
fix) 
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The posterior f(0g|x.) is now used to determine the optimal 
decision d*, by equation (10) 



After the decision is made on lot Qg at tg the procedure steps 
to lot Q-| at t-j . In order to determine the appropriate 
actions concerning this lot the distribution f(e-j) must be 
obtained. It is at this point where the model of the produc- 
tion process is used. The relationship between 9g and e-j 
must be known in order to determine the density f(0-j) based 
on the posterier f(0g|x.). The relationship between 9g and 0 ^ 
is specified by the conditional density f ( 0 ^ | 0 g ) which is 
obtained from the model of the production process. Methods 
by which this density may be obtained from the production 
model are discussed in a later section. Given that f ( 0 -j | 0 g ) 
is known then f ( 0 -j ) prior to sampling from lot is obtained 
as follows: 



Using this value as the prior for 0 ^ ; p(d*) and p(<j>,'x) 
are obtained using equations (10) and (11) as before and the 
decision rule (9) is applied. If the decision rule indicates 
that the risk can be lowered by sampling, the procedure as 
outlined for 0g is followed. If however, no sampling is 
required then decision d* is made and the procedure advances 





(19) 



0 
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to t 2 . At t 2> "f ( 9 2 ) must be obtained based on f(9 0 |x_) since 
no samples were observed at time t.| . The density f(e 2 ) would 
be determined as follows: 

f ( 0 2 ) = J f ( 0 2 1 e 0 ^ f ( 0 O l^ de 0 (20) 

0 

where f(e 2 |e 0 ) = J f(e 2 l 0 -|) f( 0 -||e o ) d 0 -| • 

0 

After f(e 2 ) is determined p(d*) and p(<j>,x_) are obtained as 
before and the decision rule applied. The entire procedure 
is then repeated to determine if samples should be tested at 
^ 3 , t ^ , • • • » etc. 

Under the decision process described it may be possible 
that no sampling would be required for several production 
lots. At first thought this may seem contrary to the objective 
of minimizing the decision risk. If the production sequence 
9 o » 9-j , 9 2 ... is highly correlated then knowledge of one 
value of e implies considerable knowledge of succeeding (and 
preceding) values. The correlation is expressed by the density 
f(9^.|e^._i ) which is derived from the model of the production 
process. The decision process thus quantifies the feeling 
"When one lot is good the next one usually is good also." 

In order to apply the Bayesian method in loss function, 

L ( 9 , d ) , and the sampling and process densities, f ( x | 0 ) and 
f ( 0 t | e t _1 ) must be specified. These are the subject of the 
following sections. 
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III. LOSS FUNCTIONS 



The purpose of this section is to examine various ways 
in which the consequences of decisions can be related to the 
true value of quality. In the preceding section this rela- 
tionship was generally referred to as a loss function, L(e,d). 
As mentioned previously the loss when the best decision is 
made for a given value of 0 is equal to zero. The loss of a 
particular decision d when 0 = 0 is the difference between 
the consequences if d is chosen and the consequences if the 
best decision were chosen. The loss then essentially repre- 
sents a regret or opportunity cost. From the above definition 
it is seen that one characteri s ti c of loss functions is that 
they are non-negative. Since in the decision process the 
risk of sampling is added to the cost of testing the loss 
function and testing cost must be expressed in similar units 
(e.g., dollars). In the following examples it is assumed that 
the utility of money is linear over the range of interest. 

This assumption alleviates the otherwise necessary transforma- 
tion of the loss in dollars to utility. If the utility of 
money is continuous then at least to a first order approxima- 
tion the linear assumption is valid. In the following 
paragraphs several examples of loss functions are discussed. 
Their presences is not meant to imply that they are in any 
way the best or most useful loss functions. The loss function 
used for a particular process depends entirely on the 
situation at hand. 
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A. SYMMETRIC LINEAR LOSS FUNCTION 

The following example demonstrates that if the decision 
consequences are a linear function of 9 then the loss function 
will be linear and symmetric about their intersection. Let 
R ( 9 » d . ) represent the consequences of decision d.. when 9 
obtains. For the linear case 



Equations (21) and (22) are plotted in figure 3. 

If the consequences are viewed as a cost then the best 
decision is that which minimizes R(9,d) for all values of 9 
Thus for 9 <_ 9q decision d 2 is best and d-j is best for 
9>_9q. The loss L ( 0 , d -j ) is 



R ( 6 , d 1 ) = a, 9 + b 



( 21 ) 



R(9,d 2 ) = a 2 9 + b 2 ,a 2 >a-|. 



( 22 ) 



a.j 9 + b-j - ( a 2 9 + b 2 ) 0 < e Q 



L ( 0 » d , ) = 



0 



9 > 9 



0 



or 




0 



(b,-b 9 ) 



From (21) and (22) e Q = ( a2 . ai ; 



Thus 





( 23 ) 
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R(e,d) 




Figure 3. Decision Consequences. 
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By the similar calculations: 

( 0 9 1 0 O 

L(9,d 2 ) = l (24) 

( Ca 2 -a 1 ) ( e-e 0 ) e > e Q 

Equations (23) and (24) are shown in figure 4 

As evidenced by equations (23) and (24) and depicted in 
figure 4 the loss functions of linear consequences are linear 
and symmetric about the intersection. 



B. QUADRATIC LOSS FUNCTIONS 

The quadratic loss function is defined as 

2 



L ( Q » d ) = 



( C 1 ( 9_9 0^ 



9 < 0 



0 



0 



9 1 9 0 



(25) 



( 0 9 < 9 0 

L(9,d 2 ) =<! 2 (26) 

/ C 2 ( 9 o -9 ) ® > ®g 

A heuristic justification for this general form for a 

loss function can be made as follows. Assume for a particular 

problem that the loss function L(e,d) and all its derivitives 

exist. Let 0q represent the dividing line between acceptable 

and unacceptable quality. If e > 9 q let d^ be the proper 

decision and if e < 9g let d 2 be the proper decision. Thus 

L C 0 > d i ) = 0 for 9 > 9g and L(e,d 2 ) = 0 for 9 < 9 Q . Define 

L(e) = max L C 9 » d ) then L C 9 ) = L(e,d-,) for e < 9 n and L(e) = 
.deD I u 

L C © > d 2 ) for 9 >_ 0q. L C 9 ) can be expressed as a Taylor series 
expension about e Q as follows: 
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Figure 4. Linear Loss Functions. 
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L 11 ( 0 ) 

1(e) = L ( e o ) + L'(e 0 )(e-e 0 ) + — g - 0 - (Q-e 0 ) 2 + ... (27) 

If 9 = 0 q then either decision or d 2 is optimal and the 
loss is zero. This implies that L(e Q ) = 0. If L(e Q ) is 
zero then 9 g is a minimum for L(e) which implies that 
L ’ ( © q ) = 0. Further if L(9g) is a minimum then L"(9 q) must 
be non-negative. 

Applying these results to the Taylor series expension 
(27) yields: 



L"(e n ) 

L ( e ) = — 2 ( 0 _ 0 q ) 



L'"( a Q ) , 

3! ( 9 ' e 0^ + 



Thus the loss function for 9 close to 9g can be approximated 
by: 



Ke.d^ 



C 1 ( 0-0g ) 2 0 < 9, 



9 — 9 0 



(28) 



L(e,d 2 ) 



9 1 9 0 



€2(0“ 0g) 0 > 9 ( 



(29) 



which is the quadratic loss function originally defined. 
Instead of specifying the indifference value 9g two values 
9 1 , 0 2 could be specified. Where 9-j would represent minimum 
acceptable quality and 9 2 would represent the maximum 
rejectable quality. Then for d-j = accept and d ? = reject the 
loss function could be expressed as follows. 
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L(e.d 1 ) 



Cl ( 9 “ ® q ) 0 JL 0 ] 

0 0 > 0 1 

0 0 1 e 2 

L C 9 » d 2 ) = ( 

(c 2 (0-0 o ) 2 0 > 0 2 

C. CONSTANT LOSS FUNCTION 

The constant loss function represents the case where 
the loss is independent of the value of 0 over a specified 
range. The constant loss function could be represented as 
fol 1 ows : 





(1 


O 

CD 

v | 

CD 






L( 0 , d i ) 


= 




(30) 


to 


CD 

V 

CD 

o 






L ( 0 , d 2 ) 


4 ° 


o 

CD 

V 

CD 




(31) 
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CD 

a| 

CD 






The expected 


loss or risk 


, p ( d ) , for decisions d ^ 


and 


d 2 woul d be : 


> 








p ( d 1 ) = 


J 1 

— 00 


• f ( 0 ) d 0 = 


OQ. 

II 

O 

CD 

V 

CD 

' — ^ 

CL 




II 

C\J 

"O 

Q. 


oo 

/ 1 


. f ( 0 ) d0 = 


P ( 0 > 0q ) ~ a 






e 0 
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The risk for decision is the probability that d^ 
is the correct decision and the risk for d 2 is the probabil- 
ity that d-j is the correct decision, p(d-j) and p(d 2 ) are 
usually referred to as the probability of type II and type I 
errors respectively and denoted by 6 and a. It can be 
shown [1] that the decision function 5 ( x ) which minimize the 



risk has the form. 



If 



•f 1 (x) 

f 2 U) 



< 



C 



then 



reject . 



where 

f ] (x) = f(x|e 0 ) 

f 2 (x) = f (x | e ) 



and 9 is the maximum likelihood estimate for 9 based on x_. 

This decision function is the generalized likelihood 
ratio criteria upon which classical hypothesis testing is 
based. Thus quality control procedures based on classical 
hypothesis testing imply that the loss functions of (30) 
and (31) are operative. 

This concludes the discussion of the most common types 
of loss functions. The next section considers the problem 
of formulating a statistical model of the production process. 
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IV. THE PRODUCTION MODEL 



As mentioned previously in order to apply the Bayesian 
decision method the conditional density f(0^|e -j) must be 

known or estimated. In order to specify this density the 
quality control specialist must specify the underlying 
mechanisms which determine how the production process is 
evolving over time. In the following models two assumptions 
wtll be made: (a) that the process is determined by a 

specific underlying relationship plus random disturbances 
and (b) that by the central limit theorem, the ghost of 
LaPlace or some other incantation, the disturbances are 
assumed to be normally distributed with known mean and 
variance. In the following paragraphs three models which 
might be used to characterize a production process are 
described. They are the linear trend model, the autoregres- 
sive model and the periodic model. The generalized linear 
model is also presented. In addition to the process models 
a typical observation model will be included for completeness. 
For convenience and to aid in the later development of the 
solutions for the generalized linear model it is also assumed 
that the observation errors are normally distributed. 
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A. LINEAR TREND MODEL 

The linear trend model represents a production process 
where the underlying trend is a linear increase or decrease 
in the character i s t i c e. Let 6^ represent the change of e 
from t-1 to t and represent the observations at time t. 
Then the observation and process models could be described 
as : 



OBSERVATION: 


x t = a t + e t e t ~N(0,o x 2 ) 


(32) 


PROCESS: 


0 1 = 0 1 _ i + 6 t + w t w t ~N(0,a 2 ) 


(33) 



In the above model e^ represents observation noise and 

represents the process noise causing de vi ati ons f rom the 

linear relationship. From the model, the conditional density 

f* ( 0 1 1 0 1 -|) would be normal with mean = E ( 9 1 ) = 9 t _-| + <5^ and 

2 

variance equal to a . In order to apply the Bayesian proce- 

2 

dure the increment 6^. and the variance a must be known or 
estimated from the process. If the uncertainty in is 
incorporated in the model the result is 

0 t = 0 t-l + 6 t + w t w t~ N ( 0 , ct 2 ) 

6 t ~ N ^ y R ,a R ^ 

2 2 

then f(e t |e t _ 1 ) ~ N ( 9 1 _ -j + u R , a R + a ) assuming 5 t and w t 
are uncorrel ated . 



\ 
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B. AUTOREGRESSIVE MODEL 

The basic autoregressive results from the following 
assumptions about the production process. 

1. f(e t ) ' N( 0 ,a 2 ) y t 

2 2 
a * pa* 

2. f(e t .e t+1 ) - N(O.C) Vt, c = ( pa 2 0 2 ) 

Assumption 1 indicates that at any time t the uncertainty 
in the location of 9 ^ can be expressed as a normal probability 
density about the overall process mean (assumed to be zero 
in this case) with process variance a common for all t. 
Assumption 2 indicates that the values of 9 at successive 
times are not independent and their joint density is 
bivariate normal with correlation coefficient p. The observa- 
tion and process model for the autoregress i ve process can 
be expressed as: 

OBSERVATION: x t = 9 t + e t e t ~ N(0,a x 2 ) (34) 

PROCESS: 9 1 = P 9 t _ 1 + w t w t ~ N ( 0 , a 2 ( 1 -p 2 ) ) (35) 

This model is useful when there appears to be no under- 
lying trend either up or down in the process and the quality 
of successive lots appears to be highly correlated. From 
the above model 

f ( 6 1 1 6 1 _ -| ) ' N(p9 t _i ,a 2 (l-p 2 )) 
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C. PERIODIC MODEL 

If the production process behaves in periodic fashion 
over some internal T then a periodic model is appropriate. 

Let e(t) represent the periodic function which describes the 
production process and let the average value of the periodic 

1 f T 

function be zero (i.e., y I e(t)dt = 0). Then the periodic 

1 J 0 

function e(t) can be approximated by the Fourier series as 

n ? 

0 ( t ) = Z (a k C0Sku, Q t + b k S I N kto t ) , u = =j- 
k- 1 

where the coefficients a^ and b^ are unknown and subject to 
disturbances. The value of n being large enough to make the 
approximation valid. 

Let e T = ( a -j a 2 ••• **• ^ n ) 

and B T ~ (cosu^t cos2u) t ... cosna) t sinu rt t ... sinnw t) 

0 0 0 0 0 

Then the observation and process models are: 

OBSERVATION: x t = B^e t + e t e t ~ N(0,a x 2 ) (36) 

PROCESS: 8 t = 0 t _ + w t w t ~ N(0,z) (37) 

where 0^ is a 2nxl vector of zeros and z = E[w t w^]. 

D. GENERAL LINEAR MODEL 

The above models represent special cases of the general- 
ized linear model incorporating special features to reflect 
particular characteri s ti cs of the production process. The 
generalized linear model is defined as follows: 
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Let x = nxl vector of observations at time t 

e t = [oxl vector of process parameters at time t 
A-j = nxp matrix characterizing the observation 
= P X P matrix characterizing the process 
e^ = nxl vector of observation noise at time t 
u> t = pxl vector of process noise at time t 
E[e t ] = E[w t ] = 0 
VAR (e t ) = E[e t eJ] = C ] 

VAR (w t ) = E[w t w[] = C 2 

Then OBSERVATION x_ t = A ] 8 t + e fc e t ~ N ( 0^, C ) (38) 

PROCESS 9 = A 2 9 t _ 1 + w t w ~ N(0,C 2 ) (39) 

is the generalized linear model. 

In order to implement the Bayesian decision procedure 
various probability densities are required to determine the 
risks of alternative actions. From section II it can be seen 
that th ree dens i ti es must be obtained in the course of the 
procedure. In order to obtain the risk of immediate decision 
without sampling the prior distribution of e,f(e) must be 
obtained. To obtain the risk of sampling the prior distribu- 
tion of x_, f(x_) and the conditional distribution of 9 given a 
sample x_, f(9|x_) must be obtained. In order to determine the 
optimal decision after sampling f ( 9 | x_) is required. Thus to 
implement the decision procedure three densities must be 
obtained at each step in the process. If the observation and 
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production process can be modeled by the generalized linear 
model of (38) and (39) the required densities can be obtained 
as follows [4]. 

Let: f(9^) the density of 0 at time t prior to 

sampl i ng 

f(e t |_x) be the posterior of 8 at time t based 
on the sample _x 

f(x^) be the sample distribution prior to sampling 

Also let the distribution of e at t-1 be N(y,z) 

Then from (38) and (39) 

f ( 9 1 ) ' N(A 2 u,A 2 eA 2 T + C 2 ) (40) 

f(x t ) - N(A 1 A 2 p > A i( A 2 zA 2 T + C 2 )A } T + (41) 

f ( e 1 1 x t ^ obtained as follows (where the matrices for which 

inverses are needed are assumed non-singular). From Bayes 
theorem f(0^|>L t ) a f(x| 9 t) f(9^-) 

From the model f(x t l 9 t ) ~ N(A-|0 t> C-j) 

Thus f(0 t |x t ) a e"^ 

where Q = (x_ - A -j 0 )^ C^(_x-A-|0 ) + 

(0-A 2 y) T (A 2 ZA 2 T + C 2 ) -1 (9-A 2 u) 

the t subscripts being deleted for convenience, collecting 
the 0 terms yields: 
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Q = 9 T (a + A 1 T C 1 " 1 A 1 )e -2 (x T C 1 ' 1 A 1 + u T A 2 T a)0 

+ x T C •] ] x + u"^A 2 ^"aA 2 U 

where A = (A 2 eA 2 T + C 2 ) _1 
completing the square results in: 

Q = ( 0 - Dd ) T D”^ ( 0 -j - D d ) + [ji T C -j ~ 1 21 + u T A 2 T AA 2 u - d T D T d] 

Thus f(9 t |x t ) ~ N ( Dd , D ) (42) 

where D' 1 = ( A 2 S A 2 T + C 2^ _1 + A 1 Tc i" 1a 1 

d = A i ^ C i ^ x_ + ( AglAg^" + ^2^ ^ 2 ^ 

Thus with the aid of the generalized linear model the required 
densities for complex multidimensional production processes 
can be evaluated in a s trai ghtforward manner using (40), (41) 
and ( 42 ) . 

Equations (40)and (41) represent the one step ahead 

predictive distributions of 0 and x based on the prior at t-1. 

As discussed in section II, if no sampling is performed at 

some t then the predictive distribution for 0 at t+1 based on 

the prior at t-1 is required. In general, a method is needed 

TH 

to obtain the k step ahead predictive distributions of 0 
and x . 

TH 

Let f(e.) ~ N ( y £ , E £ ) then the k step ahead distribution 
f ( 0 t + k ) ~ N ^ w t + k’ z t+k^ can be obta '' nec, recursively from the 
linear model as follows: 
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y t+k = A 2 y t+k-l ’ k = 0,1 ,2 



(43) 




(44) 



Let f(x. t+k ) ~ N(m t+k ,C t + k 



TH 

) be the k step ahead predictive 



dfstribution for the sample x* + k then from (43) and (44) and 



For an example of the use of the linear model and the 
risk calculations the reader is referred to Appendix A. As 
an interesting aside, the recursion relationships developed in 
(42) thru (46) are identical to the results obtained using 
Kalman filtering. 





(46) 
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V. INCORPORATING SUBJECTIVE INFORMATION 



One of the primary advantages of the Bayesian decision 
procedure is its capacity to incorporate subjective informa- 
tion into the decision process. Subjective information can 
be incorporated into the decision process by either of two 
routes, either by revisions to the process model or by 
altering the prior distribution of 9 . The method chosen 
depending on which more accurately reflects the subjective 
information. Examples of how subjective information may be 
used will be discussed with respect to the generalized linear 
model which is repeated here for reference 

OBSERVATION: x t = + e t , e t ' N(0, C ] ) 

PROCESS: 9 £ = A ^ Q t - 1 + w t* w t ~ N(0_, 

As an example of the use of subjective information, suppose 
that the autoregressive model of section IV is being used to 
model the production process and production appears to be 
fairly stable (i.e., no trends). You are informed that 
starting with the next production lot three engineering 
changes will be incorporated into the units. It has been your 
experience that whenever more than one engineering change is 
incorporated that the production quality is momentarily 
reduced and then increases with successive lots as the new 
procedures are learned and the inspectors gain experience. 
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How might this subjective information be incorporated into 
the decision process? One approach would be to adjust the 
prfor for the first change lot by lowering the mean to 
reflect the anticipated decrease in quality and increasing 
the variance to reflect the associated uncertainty as to the 
actual process value. Increasing the variance will have the 
effect of weighting the new sample data more heavily in 
determining the posterior. After adjusting the prior to 
reflect the anticipated decrease in quality the autoregres- 
sive model might be replaced by the linear trend model to 
reflect the anticipated increase of quality with successive 
lots as a result of the learning process. The rate of 
increase <$ t in the linear model could be changed for each lot 
to provide a linear approximation to the anticipated learning 
curve. As the process quality returned to its original 
level the autoregressive model would again be used. 

This example illustrates two important features of the 
Bayesian decision procedure and the use of the linear model. 
First, when using a linear model it is not necessary that 
A i , A 2 , C-| and C 2 be constant for all t only that they be 
known at time t and thus the parameters of the linear model 
are free to change as required by the process being modeled. 
The second feature of the Bayesian procedure illustrated in 
the example is adaptability. By changing the model structure 
or the prior to reflect uncertainty in the process, the 
information requirements (sample data) of the procedure 
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adapt to reflect these changes. In order to maintain the 
same risk more samples will be required if the variance of 
the prior is increased to reflect uncertainty. Thus unlike 
traditional quality control procedures where the same sampling 
and decision procedure is used the Bayesian method can adapt 
to reflect the changing requirements of the production 
process. As another example of adaptability consider the 
autoregressive model. Because of the high correlation from 
one lot to the next the method reduces the sampling required 
when quality is either very good or very poor, thus taking 
advantage of the natural excursions of the output quality. 

The adaptability feature of the Bayesian method also 
has another interesting property. It indicates where, when 
and the quantity of quality control resources to be used. 

This is especially important when trying to maximize the 
effectiveness of the quality control function on fixed or 
limited resources (i.e., labor, test facilities, etc.). 
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APPENDIX A 



A BAYESIAN DECISION EXAMPLE 

The following example is provided to illustrate how the 
Bayesian method is applied and the required risks are 
obtained. It is assumed that an autoregressive model is used 
to represent the production process and that simultaneous 
sampling is used. 

Let: OBSERVATION MODEL: x t = 6 t + e t> e t ~ N(e,c x 2 ) 

PROCESS MODEL: e = P 0 t _ 1 + w t 

w t ' N(0,a 2 (l-p 2 )) 

and the loss function is: 

L C 9 » d i ) = exp[-(9-0 o )] , - co <_ 0 <_ » 

L ( 0 , d 2 ) 3 exp[ - ( 0 q- 0 ) ] , - 00 l 9 £ » 

where : 0 q is a known constant. 

2 

Also let f ( 0 1 _ i ) " N(y t _.| ,a t _i ) be the density of 0 at time 
t-1 . 

From section IV the correspondence between the auto- 
regressive model and the generalized linear model is: 

2 2 2 

A-| = 1, A 2 = p, C ] = a x , and C 2 = a (1-p ). Thus by 
equation (40) and (41) making the above substitutions: 
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f 




f(0^) ~ p 2 a^_ + (l-p 2 )a 2 ) 

f(x) ~ N(pp t _i , P^cr t _-j^ + (l-p 2 )a 2 + o x 2 ) 
let y^ = pp t _i and cr^ 2 = p^a^_-j 2 + (l-p 2 )o 2 
then f(9 t ) ~ N(y t ,a t 2 ) (1) 

V- 

f (x) “ N(v t> a t 2 + a x 2 ) (2) 

To determine the risk of making an immediate decision from 
equation (10) 

a 2 (3) 

p(d-j) = J L ( 9 , d -j ) f ( 9 1 ) d 9 1 = exp[-| (y t - 9 Q )] 



and 



p ( d 2 ) - J' L ( 9 j d 2 ) f(9^)d9^ = exp[ ^ + ( y . 



-e 0 n 



(4) 



Equations (3) and (4) are plotted in figure A-l as a function 
of the mean y^. 

Since p(d*), the risk of the optimal decision, is equal to 
the minimum risk from figure A-l it is observed that: 



p(d *) J »< d 2> ”t * 9 0 

I P ( d i ) Uj. >_ 9g 

which implies the following decision rule: 



(5) 



d* 



( d 2 t* 

)d 1 if 



w t < 9 o 
v t - 9 0 
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Figure A-l . Decision Risk. 
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By symmetry tile rtsk of decision d* is 

2 

p(d*) = exp[— - |y-0g|] 

In order, to determine the risk of sampling and then making 
a decision d* the posterior f ( © t ,1 2i) roust be obtained. The 
process of testing a sample of size n parameterized by a 
fixed but unknown value 0 1 can be viewed as: 

x i = 9 i + e i e ~ N(0,a x 2 ) 

9^ — 9, j_] ^ — ^ ^ > • • • » n 

where the samples are assumed independent. Applying the 

2 

linear model: A-j = 1 , A 2 = 1 , C -| = a x , and C 2 = 0. 

By repeated application of equation (40)one obtains: 




1 

where: x„ = — z x. 

n n i 

i = 1 

From section II the risk of obtaining an additional sample 
and then choosing d* is: 




J p (e ,d)f ( 0 | x)de} f(x)dx 
0 



( 6 ) 
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the expression in brackets is p(d*|x_) that is, the risk of 
d* given a sample result x_. p(d-||x_) and p(d 2 |x) may be 

obtained by substituting f(e t |x_) for f ( e t ) in equations (3) 
and (4). This substitution yields: 

P ( d i |x) = expect 2 - ( 6 - 0 0 ) ] 

P (d 2 |x) = exp[^-a 2 + ( 6 - e o ) ] 



where 



2 a x 



a = 



t n 



2 °x 
CT t + ~ 



6 = 



- 2 x 

x n a t + y t rT 



2 x 
a t + “ 



From equation (5) 



p ( d* | x ) 



(d 2 |x) , e < 0 , 
(d 1 |x) , 6 >_ 9 ( 



Equation 6 becomes: 
x 

p ' " ' 



A c 

(x) = J p ( d 2 | x n ) f(x n ) dx n + / p(d 1 |x n ) f(x n )dx n (7) 



CT x ( 9 0 TP t ) 

where: = o + 9 

c no. 



2 ■ u 0 and f(x n ) - N(p, n t 2 +-^- ) 



Solving ( 7 ) yields 



p (x) = p ( d-j ) 5 



x c +a t “ y - 






2 J + P ( d 2 ) $ 



x c ~ CT t • y 

~ T? 

2 X x 
a + + — — 

t n 



(8) 
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where: p(d-|) and p(d 2 ) are as defined in (3) and (4) 

$(•) = P(z< • ) , z f ~N ( 0 , 1 ) 

|(. ) = 1 -$(. ) 

Based on equations (5) and (8) the usual decision rule is 
applied. If p ( d * ) > p(x.) + E [ C ( x ) ] take another sample 
otherwise make decision d*. 

T H 

To determine the mean and variance of the k step ahead 

2 

predictive distribution u t+k and o t+k equations (43) and (44) 
are used recursively to obtain the following results. 

“t + k a (9) 

°t+k = p2k ° t 2 + p2 ( 1 -P 2k ) (10) 

TH 

The predictive distribution for k step ahead sample 
parameterized by m t+k and C t+k by (45) and (46) are: 

m t+k " y t+k 

2 2 
C t+k = CT t+k + CT x 

In order to examine the behavior of the posterior of e, 
f(e|.x) as the sample size is increased observe that 



1 i m f ( 9 1 1 x_) = 1 i m N 
n-><» n->°° 
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Thus ltm f (e | x) N C Q t » 0 ) 



n->°° 



Since x n is a consistent estimator of 9^. This implies that 

as n increases the knowledge of 9^ as represented by f(e t |x) 

becomes 'perfect" in the sense that the variance approaches 

zero and is then known. The variance decreases approxi- 

mately as — for n >> — 

CT t 

TH 

The k step ahead prediction distribution, f(9 t+ ^) of 
(9) and (10) represents the uncertainty in 9 t + t( based on 
information up to and including time t. The lim f ( 0 1 + ^ ^ 

k^-oo 

N (0 , a ) which is the distribution of the process before any 
information is obtained. Thus as k increases the information 
obtained at time t loses its "value" in predicting the 
location of the process at t+k. The rate at which previous 
knowledge is discounted is a function of p the correlation 
between Q t+ i and 9^ which in this example was assumed 
constant for all t . 
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